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TITLE 

Giandocyte-Colony Stimulating Factor Recqjtors 

^ B^jjg.PFFFWFNr F PFT ATFT) APPTJCATION 

This application is a continuation-in-pait of U.S. Application Serial No. 522,952, 
filed April 3, 1990, which is a continuation-in-part of U.S. Application Serial No. 
416,306, filed October 3. 1989, which is a continuation-in-part of U.S. AppUcation Serial 
No. 412,816, filed on September 26, 1989. 



RAPyflRnTINp OF TWF TNVKNTTON 
The present invention relates generally to cytokine receptors and more specifically 
to granulocyte-colony stimulating factor recq)tors. 

Human Granulocyte-Colony Stimulating Factor (G-CSF) is a lineage-specific 
15 hematopoietic protein which stimulates tiie proliferation and differentiation of granulocyte- 
committed progenitor ceUs. Human G-CSF has also been shown to functionally activate 
mature neutrophils. The cDNAs for human (Nagata et al.. Nature 319;415, 1986) and 
mouse G-CSF (Tsuchiya et al., PNAS 83, 7633, 1986) have been isolated, permitting 
further strucmral and biological characterization of G-CSF. 
20 G-CSF initiates its biological effect on cells by binding to specific G-CSF receptor 

protein expressed on the plasma membrane of a G-CSF responsive cell. Because of the 
ability of G-CSF to specifically bind G-CSF receptor (G-CSFR), purified G-CSFR 
compositions wUl be useful in diagnostic assays for G-CSF. as well as in raising 
antibodies to G-CSF receptor for use in diagnosis and therapy. In addition, purified G- 
25 CSF receptor compositions may be used directiy in tiierapy to bind or scavenge G-CSF, 
thereby providing a means for regulating the immune activities of tins cytokine. In order to 
study die structural and biological characteristics of G.CSFR and die role played by G- 
CSFR in die responses of various ceU populations to G-CSF or odier cytokine stimulation, 
or to use G-CSFR effectively in tiierapy. diagnosis, or assay, purified compositions of G- 
30 CSFR are needed. Such compositions, however, are obtainable in practical yields only by 
cloning and expressing genes encoding die receptors using recombinant DNA technology. 
Efforts to purify die G-CSFR molecule for use in biochenaical analysis or to clone and 
express mammalian genes encoding G-CSFR have been impeded by lack of a suitable 
source of receptor protein or mRNA. Prior to die present invention, no ceU lines were 
35 known to express high levels of G-CSFR constinitively and continuously, which precluded 
purification of receptor for sequencing or construction of genetic libraries for direct 
expression cloning. 
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SUMMARY OF THE TNVENTTON 

The present invention provides DNA sequences encoding mammalian granulocyte- 
colony stimulating factor receptors (G-CSFR) or subunits thereof. Preferably, such DNA 
sequences are selected from the group consisting of (a) cDNA clones having a nucleotide 

5 sequence derived from the coding region of a native G-CSFR gene; (b) DNA sequences 

which are capable of hybridization to the cDNA clones of (a) under moderately stringent * 
conditions and which encode biologically active G-CSFR molecules; and (c) DNA 
sequences which are degenerate as a result of the genetic code to the DNA sequences 
defined in (a) and (b) and which encode biologically active G-CSFR molecules. The 

10 present invention also provides recombinant expression vectors comprising the DNA 
sequences defined above, recombinant G-CSFR molecules produced using the recombinant 
expression veaors, and processes for producing the recombinant G-CSFR molecules using 
the expression vectors. 

The present invention also provides isolated or purified protein compositions 

15 comprising mammalian G-CSFR. Preferred G-CSFR proteins are soluble forms of the 
native receptors. 

The present invention also provides compositions for use in therapy, diagnosis, 
assay of G-CSFR, or in raising antibodies to G-CSFR, comprising effective quantities of 
soluble native or recombinant receptor proteins prepared according to the foregoing 
20 processes. These and other aspects of the present invention will become evident upon 
reference to the following detailed description. 

BRIEF DRSCRTPTTO N OF THE DRAWTNGS 

25 FIGURE 1 shows restrictions maps of cDNA clones D-7 and 25-1 containing 

regions encoding human G-CSFR proteins. 

FIGURES 2-5 depict that cDNA sequence of clone D-7 which was isolated from a 
human placental library, and the predicted amino acid sequence of this clone. The coding 
region of the predicted mature full-length membrane-bound protein from clone D7 is 

30 defined by amino acids 1-759. The predicted N-terminal Glu of the mature protein is 
designated amino acid number 1 and is underlined. The putative transmembrane region at 
amino acids 604-629 is also underlined. 

FIGURE 6 depicts the 3' nucleotide sequence and predicted C-tenuinal amino acid 
sequence of clone 25-1, which is the result of an alternative splicing arrangement The 

35 position of the intron insertion in clone 25-1 is indicated with a ♦ after nucleotide 241 1 of 
Figure 1. The position of the intron-exon boundaries are indicated with a *, and splice- 
donor and splice-acceptor recognition sequences are boxed. Sequences also present in 
clone D-7 are underlined. 
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DFTATrRD nFSrRTPTTON OF THE rNVENTIQN 

Definitions 

G-CSF is a growth factor which induces growth and differentiation of neutrophilic 
granulocyte progenitors. The biological activities of G-CSF are mediated tiirough binding 
5 to specific cell surface receptors, referred to as "G-CSF receptors" or "G-CSFR", G- 
CSFR, as used herein, refers to proteins having amino acid sequences which are 
substantially similar to native mammalian G-CSFR amino acid sequences, such as the 
human G-CSFR sequence disclosed in Figure 1, or fragments thereof, and which are 
biologically active as defined below, in that they are capable of binding G-CSF molecules 

10 or, in their native configuration as intact human plasma membrane proteins, transducing a 
biological signal initiated by a G-CSF molecule binding to a cell, or cross-reacting with 
anti-G-CSFR antibodies raised against G-CSFR from natural (i.e., nonrecpmbinant) 
sources. Specific embodiments of G-CSFR include polypeptides substantially equivalent 
to tiie sequence of amino acids 1-759 of Figures 2-5 (clone D-7) or the sequence ol amino 

15 acids 1-776 of the protein encoded by clone 25-1 as disclosed in Figures 2-5 and 6. The 
terms "G-CSF receptor" or "G-CSFR" include, but are not limited to, soluble G-CSF 
receptors, as defined below. As used throughout tins specification, ±e term "mature" 
means a protein expressed in a form lacking a leader sequence as may be present in full- 
length transcripts of a native gene. Various bioequivalent protein and amino acid analogs 

20 are described in detail below. 

The mature N-terminal amino acid is predicted to be Glu^ (underlined and 
designated as amino acid 1 in Figures 2-5), based on the algoritimi of von Heijne, G., 
NucL Acids Res. 74:4683 (1986), for determining signal cleavage sites. However, several 
factors suggest that Ser3 may be the correa mature N-terminal amino acid, based on the 

25 observation that Ser^ is 21 amino acids from the N-tcnninal Met and is preceded by tiie 
sm Jl amino acid residue Gly, both of which are accepted criteria for identifying signal 
cleavage sites. The actual N-terminal amino acid of the mature protein can be confirmed by 
sequencing purified G-CSFR protein using standard techniques. Thus, amino acid 
sequences equivalent to those described above include, for example, amino acids -3 

30 tiirough 759 of Figures 2-5 (clone D-7) or -3 tiirough 776 of tiie protein encoded by clone 
25- 1 as disclosed in Figures 2-5 and 6. 

In their native confimration, receptor proteins are present as intact human plasma 
membrane proteins having ; extracellular region which binds to a ligand, a hydrophobic 
transmembrane region which causes tiie protein to be immobilized witiiin tiie plasma 

35 membrane lipic bilayer, and a cytoplasmic or intracellular region which interacts witii 
cytoplasmic proteins and/or chemicals to deliver a biological signal to effector cells via a 
cascade of chemical reactions witiiin tiie cytoplasm of the cell. The hydrophobic 



SUBSTITUTE SHEET 



wo 91/05046 



PCr/US90/0S434 



4 

transmembrane region and a highly charged sequence of amino acids in the cytoplasmic 
region immediately following the transmembrane region cooperatively function to halt 
transport of the G-CSFR across the plasma membrane. "Soluble G-CSFR" or sG-CSFR", 
as used in the context of the present invention, refer to a protein, or a substantially 
5 equivalent analog, having an amino acid sequence corresponding to the extracellular region 
of native G-CSFR, for example polypeptides having the amino add sequences substantially 
equivalent to the sequences of amino acids 1-603 of Figures 2-5. Equivalent sG-CSFRs 
include polypeptides which vary from the sequences shown in Figures 2-5 by one or more 
substitutions, deletions, or additions, and which retain die ability to bind G-CSF and 
10 inhibit the ability of G-CSF to transduce a signal via cell surface bound G-CSF receptor 
proteins. Because sG-CSFR proteins are devoid of a transmembrane region, they are 
secreted from the host cell in which they are produced. Equivalent soluble G-CSFR 
include, for example, the sequence of amino acids -3 through 603 of Figures 2-5. When 
administered in therapeutic formulations, sG-CSFR proteins circulate in the body and bind 
15 to circulating G-CSF molecules, preventing interaction of G-CSF with natural G-CSF 
receptors and inhibiting transduction of G-CSF-mediated biological signals, such as 
immune or inflammatory responses. The ability of a polypeptide to inhibit G-CSF signal 
transduction can be determined by transfecting cells with recombinant G-CSF receptor 
DNAs to obtain recombinant receptor expression. The cells are then contacted widi G-CSF 
20 and the resulting metabolic effects examined. If an effect results which is attributable to the 
action of the ligand, then the recombinant receptor has signal transducing activity. 
Examplary procedures for determining whether a polypeptide has signal transducing 
activity are disclosed by Idzerda et al., /. £j5p. Med. 171:861 (1990); Curtis et al., Proc. 
NatL Acad. ScL USA 86:3045 (1989); Prywes et al., EMBO 7. 5:2179 (1986); and Chou 
25 et al., /. Biol. Chem. 262:1842 (1987). Alternatively, primary cells of cell lines which 
express an endogenous G-CSF receptor and have a detectable biological response to G- 
CSF could also be utilized. 

"Substantially similar" G-CSFR include those whose amino acid or nucleic acid 
sequences vary from a reference sequence by one or more substitutions, deletions, or 
30 additions, the net effect of which is to retain biological activity of the G-CSFR protein. 
Alternatively, nucleic acid subunits and analogs are "substantially similar" to the specific 
DNA sequences disclosed herein if: (a) the DNA sequence is derived from the coding 
region of a native mammalian G-CSFR gene; (b) the DNA sequence is capable of 
hybridization to DNA sequences of (a) under moderately stringent conditions and which 
35 encode biologically active G-CSFR molecules; or DNA sequences which are degenerate as 
a result of the genetic code to the DNA sequences defined in (a) or (b) and which encode 
biologicaUy active G-CSFR molecules. Substantially similar analog proteins will be greater 
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than about 30 percent similar to the corresponding sequence of the native G-CSFR. 
Sequences having lesser degrees of similarity but comparable biological activity are 
considered to be equivalents. More preferably, the analog proteins will be greater than 
about 80 percent similar to the corresponding sequence of the native G-CSFR, in which 

5 case they are defined as being "substantially identical." In defining nucleic acid sequences, 
all subject nucleic acid sequences capable of encoding substantially similar annuo acid 
sequences are considered substantially similar to a reference nucldc add sequence. Percent 
similarity may be determined, for example, by comparing sequence information using tiie 
GAP computer program, version 6.0, available firom Ae University of Wisconsin Genetics 

10 Computer Group (UWGCG). The GAP program utilizes the alignment metiiod of 
Needleman and Wunsch (J. Mol. Biol. 48:443, 1970), as revised by Smith and Waterman 
(Arfv. Appl. Math. 2:482, 1981). Briefly, the GAP program defines similarity as tiie 
number of aligned symbols (i.e., nucleotides or amino acids) which are similar, divided by 
the total number of symbols in tiie shorter of tiie two sequences. The preferred defeult 

15 parameters for the GAP program include: (1) a unary comparison matrix (containing a 
value of 1 for identities and 0 for non-identities) for nucleotides, and the weighted 
comparison matrix of Gribskov :.r,d Burgess, NucL Acids Res. 74:6745, 1986, as 
described by Schwartz and Dayhoff, ed.. Atlas of Protein Sequence and Structure, 
National Biomedical Research Foundation, pp. 353-358, 1979; (2) a penalty of 3.0 for 

20 each gap and an additional 0.10 penalty for each symbol in each gap; and (3) fio penalty 
for end gaps. 

"Recombinant," as used herein, means tiiat a protein is derived from recombinant 
(e.g., microbial or mammalian) expression systems. "Microbial" refers to recombinant 
proteins made in bacterial or fungal (e.g., yeast) expression systenis. As a produa, 

25 "recombinant microbial" defines a protein produced in a microbial ejqiression system which 
is essentially free of native endogenous substances. Protein e^qjressed in most bacterial 
cultures, e.g., E. coli, wiU be ftee of glycan. Protein expressed in yeast may have a 
glycosylation pattern different from that expressed in manamalian cells. 

"Biologically active," as used throughout the specification as a characteristic of G- 

30 CSF recq)tors, means that a particular molecule shares sufficient amino acid sequence 
similarity wi the embodiments of the present invention disclosed herein to be cqjable of 
binding dete. ible quantities of G-CSF, transmitting a G-CSF stimulus to a cell, for 
example, as a component of a hybrid receptor construct, or cross-reacting with anti-G- 
CSFR antibodies raised against G-CSFR from nanoral (ie., nonrecombinant) sources. 

35 Preferably, biologically active G-CSF receptors within the scope of tiie present invention 
are capable of binding greater tiian 0.1 nmoles G-CSF per nmole receptor, and most 
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preferably, greater than 0.5 nmole G-CSF per nmole receptor in standard binding assays 
(see below). 

"DNA sequoice" refers to a DNA polymer, in the form of a separate fragment or as 
a component of a larger DNA constract, which has been derived from DNA isolated at least 

5 once in substantially pure form, i.e., free of contaminating endogenous materials and in a 
quantity or concentration enabling identification, manipulation, and recovery of the 
sequence and its component nucleotide sequences by standard biochemical methods, for 
exaii5)le, using a cloning vector. Such sequences are preferably provided in die forai of an 
open reading frame uninterrapted by internal nontranslated sequences, or introns, which 

10 are typically present in eukaiyotic genes. Genomic DNA containing die relevant sequences 
could also be used. Sequences of non-translated DNA may be present 5* or 3' from the 
open reading frame, where the same do not interfere with manipulation or expression of the 
coding regions. 

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. DNA 
15 sequences encoding the proteins provided by this invention can be assembled from cDNA 
fragments and short oligonucleotide linkers, or from a series of oligonucleotides, to 
provide a synthetic gene which is capable of being expressed in a recombinant 
transcriptional unit 

"Recombinant expression vector" refers to a replicable DNA construct used either to 

20 amplify or to express DNA which encodes G-CSFR and which includes a transcriptional 
unit comprising an assembly of (1) a genetic element or elements having a regulatory role in 
gene expression, for example, promoters or enhancers, (2) a strucmral or coding sequence 
which is transcribed into mRNA and translated into protein, and (3) appropriate 
transcription and translation initiation and termination sequences. Structural elements 

25 intended for use in yeast e^qnession systems preferably include a leader sequence enablm^ 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an N-terminal 
metiiionine residue. This residue may optionally be subsequently cleaved from die 
e^qnessed recombinant protein to provide a final product 

30 "Recombinant microbial expression system" means a substantially homogeneous 

monoculture of suitable host microorganisms, for example, bacteria such as £. coli or yeast 
such as S. cerevisiae, which have stably integrated a recombinant transcriptional unit into 
chromosomal DNA or carry die recombinant transcriptional unit as a component of a 
resident plasnud. Generally, cells constimting die system are die progeny of a single 

35 ancestral transformant Recombinant expression systems as defined herein will express 
heterologous protein upon induction of die regulatory elements linked to die DNA sequence 
or synthetic gene to be expressed. 
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The term "isolated", as used in the context of this specification to define the purity 
of a G-CSFR or sG-CSFR protein or protein composition, rncans that the protein or 
protein composition is substantiaUy free of other proteins of natural or endogenous origin 
and contains less than about 1% by mass of protein contaminants residual of production 
5 pitwesses. Such compositions, however, can contain other proteins added as stabilizers, 
earners, cxcipients or co-tiierapeutics. G-CSFR or sG-CSFR is isolated if it is detectable 
as a single piotein band in a polyaciylamide gd by silver staining. 

Tsnlarion of cDh f Aft Fnt-nding G-CSFR 

10 The coding sequence of a mammalian G-CSFR is obtained by first isolating a 

cDNA sequence encoding G-CSFR from a recombinant DNA Uhrary generated using either 
genomic DNA or cDNA. The preferred method for constructing a cDNA library is to 
prepare polyadenylated mRNA obtained from a particular cell line which expresses a 
mammalian G-CSFR and converting the polyadenylated RNA to cDNA by reverse 

15 transcription. A particularly preferred cellular source of mRNA for construction of the 
cDNA library is human placental RNA. 

A cDNA library will contain G-CSFR sequences which can be readily identified by 
screening the library with an appropriate nucldc acid probe which is capable of hybridizing 
with G-CSFR cDNA. Such probes can be derived from tiie nucleotide sequences disclosed 

20 herein. Alternatively, DNAs encoding G-CSFR proteins can also.be asscnibled by ligation 
of synthetic oligonucleotide subunits to provide a complete coding sequence. 

The cDNAs encoding G-CSFR of die present invention were isolated by die 
method of direct expression cloning. Specifically, a cDNA Ubraiy was constructed by first 
isolating cytoplasmic mRNA from human placental tissue using standard techniques. 

25 Polyadenylated mRNA was isolated and used to prepare double-stranded cDNA. Purified 
cDNA fragments were tiien Ugated into psfCAV vector DNA described in detail below in 
Example 2. The psfCAV vectors containing die G-CSFR cDNA fragments were 
transformed into E. coli str DH5a. Transformants were plated to provide approximately 
800 colonies per plate. The resulting colonies were harvested and each pool used to 

30 prepare plasmid DNA for transfection into COS-7 cells essentially as described by Cosman 
et al. {Nature 572:768, 1984) and Lutimian et al. (Nucl. Acid Res. ii: 1295, 1983). 
Truiisformants expressing biologically active cell surface G-CSF receptors were identified 
by screening for die abiUty of G-CSFR to bind 125i.g.CSF (5 x 10-10 M). Specifically, 
transfected COS-7 cells were incubated witii medium containing l25i^.cSF, tiie cells 

35 washed to remove unbound labeled G-CSF, and die cell monolayers contacted widi X-ray 
fihn to detect concentrations of G-CSF binding, as disclosed by Sims et al. Science 
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247:585 (1988). Transfectants detected in this manner appear as dark foci against a 
relatively light background. 

This approach as used to screen approximately 30,000 cDNAs in pools of 
approximately 600 cDNAs until assay of a transfectant pool indicated positive foci for G- 

5 CSF binding. A frozen stock of bacteria firom this positive pool was grown in culture and 
plated to provide individual colonies, which were screened until single clones were 
identified which are capable of directing synthesis of a surface protein with detectable G- 
CSF binding activity. Additional cDNA clones can be isolated from cDNA libraries of 
other mammalian species by cross-species hybridization of human G-CSFR cDNAs with 

10 cDNA derived firom other mammalian species. For use in hybridization, DNA encoding G- 
CSFR may be covalentiy labeled witii a detectable substance such as a fluorescent group, a 
radioactive atom or a chemiluminescent group by methods well known to those skilled in 
the art Such probes could also be used for in vitro diagnosis of particular conditions. 

Like most mammalian genes, mammalian G-CSF receptors are presumably 

15 encoded by multi-exon genes. Alternative mRNA constructs which can be attributed to 
different mRNA splicing events following transcription, and which share large regions of 
identity or similarity with the cDNAs claimed herein, are considered to be within the scope 
of the present invention. 

20 Proteins and Analogs 

The present invention provides isolated recombinant mammalian G-CSFR 
polypeptides as defined above. Isolated G-CSFR polypeptides are substantially free of 
other contaniinating materials of natural or endogenous origin and contain less than about 
1% by mass of protein contaminants residual of production processes. Such polypeptides 

25 are optionally without associated native-pattern glycosylation. Mammalian G-CSFR of the 
present invention includes, by way of example, primate, human, murine, canine, feline, 
bovine, ovine, equine and porcine G-CSFR. Derivatives of G-CSFR within die scope of 
the invention also include various structural forms of the primary protein which retain 
biological activity. Due to the presence of ionizable amino and carboxyl groups, for 

30 example, a G-CSFR protein may be in the form of acidic or basic salts, or may be in 
neutral fornt Individual anuno acid residues may also be modified by oxidation or 
reduction. 

The primary amino acid structure may be modified by forming covalent or 
aggregative conjugates with other chemical moieties, such as glycosyl groups, lipids, 
35 phosphate, acetyl groups and the like, or by creating amino acid sequence mutants. 
Covalent derivatives are prepared by linking particular functional groups to G-CSFR amino 
acid side chains or at the N- or C-tennini. Otiier derivatives of G-CSFR within the scope 
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of this invention include covalent or aggregative conjugates of G-CSFR or its fragments 
with other proteins or polypeptides, such as by synthesis in recombinant culture as N- 
terminal or C-teiminal fusions. For example, the conjugated peptide may be a a signal (or 
leader) polypeptide sequence at the N-terminal region of the protein which co- 

5 translationally or post-translationally directs transfer of the protein from its site of synthesis 
to its site of function inside or outside of the cell membrane or wall (e.g.. the yeast a-factor 
leader). G-CSFR protein fusions can comprise peptides added to faciUtate purification or 
identification of G-CSFR (e.g., poly-His). Tht amino acid sequence of G-CSF receptor 
can also be linked to the peptide Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK) 

10 (Hopp et al., Bio/Technohgy 6:1204,1988.) Hie latter sequence is highly antigenic and 
provides an epitope reversibly bound by a specific monoclonal antibody, enabling rapid 
assay and facUe purification of expressed recombinant protein. This sequence is also 
specifically cleaved by bovine mucosal enterokinase at the residue immediately foUowing 
the Asp-Lys pairing. Fusion proteins capped with this peptide may also be resistant to 

1 5 intracellular degradation in E. colL 

G-CSFR derivatives may also be used as immunogens, reagents in receptor-based 
immunoassays, or as binding agents for affinity purification procedures of G-CSF or oriier 
binding Hgands. G-CSFR derivatives may also be obtained by cross-linking agents, such 
as M-maleimidobenzoyl succinimide ester and N-hydroxysucdnimide, at cysteine and 
20 lysine residues. G-CSFR proteins may also be covalenfly bound tiirough reactive side 
groups to various insoluble substrates, such as cyanogen bromide-activated, bisoxiiane- 
activated, carbonyldiimidazole-acrivated or tosyl-activated agarose structures, or by 
adsorbing to polyolefin surfaces (with or witiiout glutaraldehyde cross-linking). Once 
bound to a substrate, G-CSFR may be used to selectively bind (for purposes of assay or 
25 purification) anti-G-CSFR antibodies or G-GSF. 

The present invention also includes G-CSFR witii or witiiout associated native- 
pattem glycosylation. G-CSFR expressed in yeast or mammalian expression systems. 
e.g., COS-7 cells, may be similar or slightiy different in molecular weight and 
glycosylation pattern tiian the native molecules, depending upon tiie expression system. 
Expression of G-CSFR DNAs in bacteria such as £. coli provides non-glycosylated 
molecules. Functional mutant analogs of mammalian G-CSFR having inactivated N- 
glycosylation sites can be duced by oligonucleotide synthesis and ligation or by site- 
specific mutagenesis tech..iques. These analog proteins can be produced in a 
homogeneous. leduced-cariwhydrate form in good yield using yeast expression systems. 
35 N-glycosylation sites in eukaryotic proteins arc characterized by die amino acid triplet Asn- 
Ai-Z. where Al is any amino acid except Pro. and Z is Scr or Thr. In tius sequence, 
asparagine provides t side chain amino group for covalent attachment of cariwhydrate. 



30 
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Such a site can be eliminated by substituting another amino acid for Asn or for residue Z, 
deleting Asn or Z, or inserting a non-Z amino acid between Ai and Z, or an amino acid 
otfier than Asn between Asn and Ai. 

G-CSFR derivatives may also be obtained by mutations of G-CSFR or its subunits. 

5 A G-CSFR mutant, as referred to herein, is a polypeptide homologous to G-CSFR but 
which has an amino acid sequence different from native G-CSFR because of a deletion, 
insertion or substitution. 

Bioequivalent analogs of G-CSFR proteins may be constructed by, for example, 
making various substitutions of residues or sequences or deleting terminal or internal 

10 residues or sequences not needed for biological activity. For exainple, aliphatic amino acid 
residues, such as He, Val, Leu or Ala may be substituted for one another, or polar amino 
acid residues, such as Lys and Arg, Glu and Asp, or Gb and Asn, may be substituted for 
one another. Also, cysteine residues can be deleted or replaced with other amino acids to 
prevent formation of incorrect intramolecular disulfide bridges upon renaturation. Other 

15 approaches to mutagenesis involve modification of adjacent dibasic amino add residues to 
enhance expression in yeast systems in which KEX2 protease activity is present. 
Generally, substitutions should be made conservatively; i.e., the most preferred substimte 
amino acids are those having physicochemical characteristics resembling those of the 
residue to be replaced. Similarly, when a deletion or insertion strategy is adopted, the 

20 potential efifect of the deletion or insertion on biological activity should be considered 

Subunits of G-CSFR may be constructed by deleting terminal or internal residues 
or sequences. Particularly preferred subunits include those in which the transmembrane 
region and intracellular domain of G-CSFR are deleted or substituted with hydrophilic 
residues to facilitate secretion of the receptor into the cell culmre medium. The resulting 

25 protein is a soluble trjncated G-CSFR molecule which may retain its ability to bind G- 
CSF. 

Mutations in nucleotide sequences constructed for expression of analog G-CSFR 
must, of course, preserve the reading frame phase of the coding sequences and preferably 
will not create complementary regions that could hybridize to produce secondary mRNA 

30 structures such as loops or hairpins which would adversely affect translation of the receptor 
mRNA. Although a mutation site may be predetermined, it is not necessary that the nature 
of the mutation per se be predetermined. For example, in order to select for optimum 
characteristics of mutants at a given site, random mutagenesis may be conducted at the 
target codon and the expressed G-CSFR mutants screened for die desired activity. 

35 Not all mutations in the nucleotide sequence which encodes G-CSFR will be 

expressed in the final product, for example, nucleotide substitutions may be made to 
enhance expression, primarily to avoid secondary structure loops in the transcribed mRNA 
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(see EPA 75,444A, incorporated herein by reference), or to provide codons that are more 
readily translated by the selected host, e.g., the well-known E. coli preference codons for 
E. coli expression. 

Mutations can be introduced at particular loci by synthesizing oligonucleotides 
5 containing a mutant sequence, flanked by restriction sites enabling ligation to fragments of 
the native sequence. Following ligation, the resulting reconstructed sequence encodes an 
analog having the desired amino acid insertion, substittition, or deletion. 

Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can be 
employed to provide an altered gene having particular codons altered according to tiie 
10 substimtion, deletion, or insertion required. Exemplary metiiods of nbaking die alterations 
set forth above are disclosed by Walder et al. {Qene 42:133, 1986); Bauer et al. {Gene 
57:73, 1985); Craik {BioTechniques, January 1985, 12-19); Smitii et al. (Genetic 
Engineering: Principles and Methods, Plenum Press, 1981); and U.S. Patent Nos. 
4^18,584 and 4,737,462 disclose suitable techniques, and are incoiporatcd by reference 
15 herein. 

Expression of Recombina nt G-CSFR 

The present invention provides recombinant expression vectors which include 
synthetic or cDNA-derived DNA fragments encoding manunalian G-CSFR or 

20 bioequivalent analogs operably linked to suitable transcriptional or translational regulatory 
elements derived from mammalian, microbial, viral or insea gmes. Such regulatory 
elements include a transcriptional promoter, an optional operator sequence to control 
transcription, a sequence encoding suitable mRNA ribosomal binding sites, and sequences 
which control die termination of transcription and translation, as described in detail below. 

25 The ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transforanants may additionally be incoiporated. 
DNA regions arc operably linked when they are functionally related to each other. For 
example, DNA for a signal peptide (secretory leader) is operably linked to DNA for a 
polypeptide if it is expressed as a precursor which participates in the secretion of die 

30 polypeptide; a promoter is operably linked to a coding sequence if it controls tiie 
transcription of die sequence; or a ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to permit omslation. Generally, operably linked means 
contiguous and, in die case of secretory leaders, contiguous and in reading ftame. 

DNA sequences encoding mammalian G-CSF receptors which are to be expressed 

35 in a micro?^rganism will preferably contain no introns tiiat could prematurely terminate 
transcription of DNA into mRNA; however, prcmatore termination of transcription may be 
desirable, for example, where it would result in mutants having advantageous C-tcrminal 



wo 91/05046 



PCT/US90/05434 



13 

(BoUvar et al.. Gene 2:95, 1977). pBR322 contains genes for ampidllin and tetracycline 
resistance and thus provides simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include the 
P-lactamase (penicillinase) and lactose promoter system (Chang et al., Nature 275:615, 
5 1978; and Goeddel et al., Nature 281 :544, 1979), the tryptophan (trp) promoter system 
(Goeddel et al., Nucl. Acids Res. 5:4057, 1980; and EPA 36.776) and tac, promoter 
(Maniatis, Molecular Cloning: A Laboratory Manual, CM Spring Harbor Laboratory, p. 
412, 1982). A particularly useful bacterial expression system employs the phage X Pl 
promoter and cI857ts thermolabile repressor. Plasmid vectors available from the American 

10 Type Culture Collection which incorporate derivatives of the X Pl FO™oter include 
plasmid pHUB2, resident in E. coli strain JMB9 (ATCC 37092) and pPLc28. resident in 
E. coli RRl (ATCC 53082). 

Recombinant G-CSFR proteins may also be expressed in yeast hosts, preferably 
from the Saccharomyces species, such as S. cerevisiae. Yeast of other genera, such as 

15 Pichia or Kluyveromyces may also be employed. Yeast vectors will generally contain an 
origin of replication from the 2\i yeast plasmid or an autonomously replicating sequence 
(ARS), promoter, DNA encoding G-CSFR, sequences for polyadenylation and 
transcription termination and a selection gene. Preferably, yeast vectors will include an. 
origin of replication and selectable maricer permitting transformation of both yeast and E. 

20 coli, e.g., tiie ampicillin resistance gene of E. coli and 5. cerevisiae trpl gene, which 
provides a selection marker for a mutant stt^n of yeast lacking tiie ability to grow in 
tryptophan, and a promoter derived from a highly expressed yeast gene to induce 
transcription of a structural sequence downstream. The presence of die trpl lesion in die 
yeast host ceU genome then provides an effective environment for detecting transformation 

25 by growth in the absence of tryptophan. 

Suitable promoter sequences in yeast vectors include die promoters for 
metallothionein. 3-phosphoglycerate kinase (Hitzeman ei al.. /. Biol. Chem. '5:2073. 
1980) or otiier glycolytic enzymes (Hess et aL, 7. Adv. Enzyme Reg. 7:149. 1968; and 
Holland et al., Biochem. 77:4900, 1978), such as enolase, glyceraldehyde-3-phosphate 

30 dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase, and glucokinase. Suitable vectors and promoters 
for use in yeast expression arc further described in R. Hitzeman et al., EPA 73,657. 

Prefened yeast vectors can be assembled using DNA sequences from pBR322 for 

35 selection and replication in £. coli (AmpT gene and origin of replication) and yeast DNA 
sequences including a glucose-repressible ADH2 promoter and a-factor secretion leader. 
The ADH2 promoter has been described by Russell et al. (J. Biol. Chem.258:2eJ4, 1982) 
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and Beier ct al, (Nature 300:724, 1982). The yeast a-factor leader, which directs 
secicdon of heterologous proteins, can be inserted between the promoter and the structural 
gene to be expressed. See, e.g., Kuijan et al., Cell 50:933, 1982; and Bitter et al., Proc. 
Natl. Acad. Sci. USA 8i :5330, 1984. The leader sequence may be modified to contain, 
5 near its 3' end, one or more useful restriction sites to facilitate fusion of the leader sequence 
to foreign genes. 

Suitable yeast transformation protocols are known to those of skill in the art; an 
exemplary technique is described by Hinnen et al., Proc. Natl Acad. Sci. USA 75:1929, 
1978, selecting for Tip"*" transformants in a selective medium consisting of 0.67% yeast 
10 nitrogen base, 0.5% casamino acids, 2% glucose, 10 [Xg/ml adenine and 20 |ig/ml uracil. 

Host strains transforaied by vectors comprising the ADH2 promoter may be grown 
for expression in a rich medium consisting of 1% yeast extract, 2% peptone, and 1% 
glucose supplemented with 80 ^ig/ml adenine and 80 p-g/ml uracil. Derepression of the 
ADH2 promoter occurs upon exhaustion of medium glucose. Crude yeast supematants are 
15 harvested by filtration and held at 4*C prior to further purification. 

Various mammalian or insect cell culture systems can be employed to express 
recombinant protein. Baculovirus systems for production of heterologous proteins in 
insect cells arc reviewed by Luckow and Summers, Bio/Technology 6:41 (1988). 
Examples of suitable mammalian host cell lines include the COS-7 lines of monkey kidney 
20 cells, described by Gluzman {Cell 25:175, 1981), and other cell lines capable of expressing 
an appropriate vector including, for example, L cells, C127, 3T3, Chinese hamster ovary 
(CHO), HeLa and BHK cell lines. Mammalian expression vectors may comprise 
nontranscribed elements such as an origin of replication, a suitable promoter and enhancer 
linked to the gene to be expressed, and other 5* or 3' flanking nontranscribed sequences, 
25 and 5* or 3' nontranslated sequences, such as necessary ribosome binding sites, a 
polyadenylation site, splice donor and acceptor sites, and transcriptional termination 
sequences. 

The transcriptional and translational control sequences in expression vectors to be 
used in transforming vertebrate cells may be provided by viral sources. For example, 

30 commonly used promoters and enhancers arc derived from Polyoma, Adenovirus 2, 
Simian Vims 40 (SV40), and human cytomegalovirus. DNA sequences derived &x>m die 
S V40 viral genome, for example, S V40 origin, early and late promoter, enhancer, splice, 
and polyadenylation sites may be used to provide the other genetic elements required for 
expression of a heterologous DNA sequence. The early and late promoters are particularly 

35 useful because both arc obtained easily from the virus as a fragment which also contains the 
SV40 viral origin of rcplication (Fiers et al., Nature 273:113, 1978). Smaller or larger 
S V40 fragments may also be used, provided the approximately 250 bp sequence extending 
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from the Hind m site toward the Bgll site located in the viral origin of replication is 
included. Further, mammalian genomic G-CSFR promoter, control and/or signal 
sequences may be utilized, provided such control sequences are compatible with the host 
cell chosen. Additional details regarding the use of a mammalian high e;qjression vector to 
5 produce a recombinant mammalian G-CSF receptor arc provided in Example 2 below. 
Exemplary vectors can be constructed as disclosed by Okayama and Bag (Mol. Cell. Biol. 
5:280, 1983). 

A useful system for stable high level expression of mammali an receptor cDNAs in 
C127 murine mammary epithelial cells can be constracted substantially as described by 
10 Cosman et al. {Mol. Immunol. 2J:935, 1986). 

A particularly preferred eukaryotic vector for expression of G-CSFR DNA is 
disclosed below in Example 2. This vector, referred to as pCAV/NCT, was derived ftom 
the m^TnTtinlian high expression vector pDC201 and contains regulatory sequences from 
SV40, adenovirus-2, and human cytomegalovirus. 
15 Purified mammalian G-CSF receptors or analc^s arc pr^ared by cukuring suitable 

host/vector systems to express the recombmant translation products of the DNAs of the 
present invention, which are then purified fioni culture media or cell extracts. 

For example, supematants from systems which secrete recombinant protein into 
culture media can be first concentrated using a commercially available protdn concentration 

20 filter, for example, an Amicon or Millipore Pellicon ultrafiltration unit Following the 
concentration step, the concentrate can be applied to a suitable purification matrix. For 
example, a suitable affinity matrix can comprise a G-CSF or lectin or antibody molecule 
bound to a suitable support. Alternatively, an anion exchange resin can be employed, for 
example, a matrix or substrate having pendant diethylaminoethyl PEAE) groups. The 

25 matrices can be aaylamide, agarose, dextran, ceUulose or other types commonly employed 
in protein purification. Alternatively, a cation exchange step can be employed. Suitable 
cation exchangers include various insoluble matrices comprising sulfopropyl or 
carboxymethyl groups. Sulfopropyl groups are prefened. 

Finally, one or more reversed-phase high performance liquid chromatography (RP- 

30 HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
methyl or other aliphatic groups, can be employed to further purify a G-CSFR 
' composition. Some or all of the foregoing purification steps, in various combinations, can 
also be employed to provide a homogeneous recombinant protein. 

Recombinant protein produced in bacterial culture is usually isolated by initial 

35 extraction from cell peUets, followed by one or more concentration, salting-out, aqueous 
ion exchange or size exclusion chromatography steps. Finally, high performance liquid 
chromatography (HPLQ can be employed for final purification steps. Mcrobial cells 
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employed in expression of recombinant mammalian G-CSFR can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or 
use of cell lysing agents. 

Fennentation of yeast which express mammalian G-CSFR as a secreted protein 
5 greatly siii5)lifies purification. Secreted recombinant protein resulting from a large-scale 
fermentation can be purified by methods analogous to those disclosed by Urdal et al. (/. 
Chromatog, 296:171, 1984). This reference describes two sequential, reversed-phase 
HPLC steps for purification of recombinant human GM-CSF on a preparative HPLC 
column. 

10 Human G-CSFR synthesized in recombinant culture is characterized by the 

presence of non-human cell components, including proteins, in amounts and of a character 
which, depend upon the purification steps taken to recover human G-CSFR from the 
culture. These components ordinarily will be of yeast, prokaryotic or non-human higher 
eukaryotic origin and preferably are present in innocuous contaminant quantities, on the 

15 Older of less than about 1 percent by weight Further, recombinant cell culture enables the 
production of G-CSFR free of proteins which may be normally associated with G-CSFR 
as it is found in nature in its species of origin, e.g. in cells, cell exudates or body fluids. 

G-CSFR compositions are prepared for administration by mixing G-CSFR having 
the desired degree of purity with physiologically acceptable carriers. Such carriers will be 

20 nontoxic to recipients at the dosages and concentrations employed. Ordinarily, the 
preparation of such compositions entails combining the G-CSFR with buffers, antioxidants 
such as ascorbic acid, low molecular weight Qess than about 10 residues) polypeptides, 
proteins, amino acids, carbohydrates including glucose, sucrose or dextrins, chelating 
agents such as EDTA, glutathione and other stabilizers and excipients. 

25 G-CSFR compositions may be used to attenuate G-CSF-mediated immune 

responses. To achieve this result, a therapeutically effective quantity of a G-CSFR 
composition is administered to a mammal, preferably a human, in association with a 
pharmaceutical carrier or diluent 

The following examples are offered by way of illustration, and not by way of 

30 limitation. 

EXAMPLES 
Example 1 

35 Binding Assays 

A. RadiolabelingofG'CSF. Recombinant human G-CSF, in die form of a fusion 
protein containing a hydrophilic octapeptide at the N-terminus, was expressed in yeast as a 
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secreted protein and purified by affinity chromatography as described by Hopp et al., 
Bio/Technology 6:1204, 1988. The protein was radiolabeled using the commercially 
available solid phase agent, lODO-GEN (Pierce). In this procedure, 5 ^ig of lODO-GEN 
were plated at the bottom of a 10 X 75 mm glass tube and incubated for 20 nrinutes at 4*C 

5 with 75 jil of 0.1 M sodium phosphate, pH 7.4 and 20 nl (2 mQ) Na ^^L This solution 
was then transferred to a second glass mbe containing 5 ng G-CSF in 45 ^ll PBS for 20 
minutes at 4*C. The reaction mixture was fractionated by gel filtration on a 2 nol bed 
volume of Sq)hadex G-25 (Sigma) equilibrated in Roswdl Park Memorial Institote (RPMI) 
1640 medium containing 2.5% (wA') bovine serum albumin (BSA), 0.2% (wA') sodium 

10 azide and 20 mM Hepes pH 7.4 (binding medium). The final pool of l25i<j.cSF was 
diluted to a working stock solution of 1 x 10-'^ M in binding medium and stored for up to 
one month at 4*C witiiout detectable loss of receptor binding activity. The specific activity 
is routinely 1 x 10^6 cpm/mmole G-CSF. Radiolabeled G-CSF is used as described below 
to assay for G-CSF receptors. 

15 B. Membrane Binding Assays. Human placental membranes were incubated at 

4*C for 2 hr with 125i.G-CSF in binding medium, 0.1% bacittacin, 0.02% aprotinin, and 
0.4% BSA in a total volume of 1.2 ml. Control mbes containing in addition a 100 x molar 
excess of unlabeled G-CSF were also included to determine non-specific binding. The 
reaction mixture was then centrifuged at 15,000x g in a microfuge for 5 minutes. 

20 Supematants were discarded, the surface of the membrane pidlets carefiilly rinsed with ice- 
cold binding medium, and die radioactivity counted on a gamma counter. Using tins as.'wiy, 
it was determined tiiat die G-CSFR present in the COS cell supematants of Exanple 2 had 
a Ka of about 1 x 10^ M"! and a molecular weight of about 35 U)a. 

C. Solid Phase Binding Assess. The ability of G-CSFR to be stably adsorbed to 

25 nitrocellulose from detergent extracts of human cells yet retain G-CSF-binding activity 
provided a means of detecting G-CSFR. Cells extracts were prepared by mixing a cell 
pellet with a 2X volume of PBS containing 1% Triton X-100 and a cocktail of protease 
inhibitors (2 mM phenylmethyl sulfonyl fluoride, 10 nM pepstatin, 10 pM leupcptin, 2 mM 
o-phenantiiroline and 2 mM EGTA) by vigorous vortexing. The mixture was incubated on 

30 ice for 30 minutes after which it was centrifuged at 12,0()0x g for 15 minutes al 8*C to 

remove nuclei and other de -is. Two microliter aliquots of cell extracts were placed on dry 
BA85/21 nitrocellulose mcnbranes (Schleicher and Schuell, Kieene, NH^ and allowed to 
dry. The membranes were incubated in tissue culture dishes for 30 min . s in Tris (0.05 
M) buffered saline (0.15 M) pH 7 J containing 3% w/v BSA to block norispecific binding 

35 sites. The membrane was then covered witii 0.3 nM l25i.<3^SF in PBS + 3% BSA and 
incubated for 2 hr at 4*C witii shaking. At the end of this time, the membranes were 
washed 3 times in PBS, dried and placed on Kodak X-Omat AR film for 18 hr at -70*C. 
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This assay was perfonned to detect the presence of G-CSFR in various cells lines and 
tissue sources. • 

D, Binding Assay for Soluble G-CSFR. Soluble G-CSFR present in COS-7 cell 
supOTiatants are nieasured by inhibition of I25i-CSF binding to a G-CSF-dependent cell 

5 line, or any other human cell or cell line expressing G-CSF receptors, such as as human 
placental cell Supematants arc harvested from COS-7 cells 3 days after transfection, 
concentrated 10-fold, and preincubated with 125i.g.CSF for 1 hour at 37*C Appropriate 
G-CSF-receptor-bearing cells are added to a final volume of 150 ul, incubated for an 
additional 30 minutes at 37*C and assayed and analyzed as described by Paric et aL, J. 

10 BioL Chem. 267:4177 (1986). 

Example 2 

Isolation of Human G-CSF R cDNAs bv Direct Expression of Active Protein in COS'7 

Cells 

15 

A tissue source for G-CSFR was selected by screening various human cell lines 
and tissues for expression of G-CSFR based on their ability to bind l^Sj-labeled G-CSF, 
prepared as described above in Example 1 A, Human placental membranes were fotmd to 
express a reasonable number of receptors. Equilibrium binding studies were performed 
20 according to Example IB and showed that the membrane exhibited biphasic binding of 
125I-G-CSF with high affinity sites (Ka = 4 x 10^9 M-1) of 0.4 pmoles receptor/mg 
protein. 

An unsized cDNA library was constructed by reverse transcription of 
polyadenylated mRNA isolated from total RNA extracted from the human placental tissue 

25 (Ausubel et al., eds.. Current Protocols in Molecular Biology, Vol. 1, 1987). The cells 
were harvested by lysing the tissue cells in a guanidinium isothiocyanate solution and total 
RNA was isolated using standard techniques as described by Maniatis, Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, 1982. 

Polyadenylated RNA was isolated by oligo dT cellulose chromatography and 

30 double-stranded cDNA was prepared by a method similar to that of Gubler and 
Hoffman,Genc 25:263, 1983. Briefly, the polyadenylated RNA was converted to an 
RNA-cDNA hybrid witii reverse transcriptase using oligo dT as a primer. The RNA- 
cDNA hybrid was then converted into double-stranded cDNA using RNAase H in 
combination with DNA polymerase 1. The resulting double stranded cDNA was blunt- 

35 ended with T4 DNA polymerase. BglR adaptors were ligated to the 5* ends of the resulting 
blunt-ended cDNA as described by Haymerle, et al., Nuclear Acids Research, 14: 8615, 
1986. The non-ligated adaptors were removed by gel filtration chromatography at 68*C, 
leaving 24 nucleotide non-self-complementary overhangs on the cDNA. The same 
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procedure was used to convert die 5* BglH ends of the mammalian expression vector 
psfCAV to 24 nucleotide overiiangs complementary to those added to the cDNA. Optimal 
proportions of adaptored vector and cDNA were Ugated in die presence of T4 
polynucleotide kinase. Dialyzed ligation mixtures were electroporated into E. coli strain 

5 DH5a and transfonnants selected on ampicillin plates. 

The resulting cDNAs were ligated into die eukaryotic expression vector psfCAV, 
which was designed to express cDNA sequences inserted at its multiple cloning site when 
transfected into mammaUan ceUs. psfCAV was assembled from pDC201 (a derivative of 
pMLSV, previously described by Cosman et al.. Nature 312: 768, 1984), SV40 and 

10 cytomegalovirus DNA and comprises, in sequence witfi die direction of transcription from 
the origin of replication: (1) SV40 sequences from coordinates 5171-5270 containing die 
origin of repUcation, enhancer sequences and eariy and late promoters; (2) cytomegalovirus 
sequences containing die promoter and enhancer regions (nucleotides 671 to +63 from die 
sequence pubUshed by Boechart et al. (Cell 41:521, 1985); (3) adenovirus-2 sequences 

15 from coordinates 5779-6079 containing sequences for die motor late promoter and die first 
exon of die tripartite leader (TPL), cooidinates 7101-7172 and 9634-9693 containing die 
second exon and part of die third exon of die TPL and a multiple cloning site (MCS) 
containing sites for Xhol, Kpnl. Smal andB^fl; (4) SV40 sequences from coordinates 
4127-4100 and 2770-2533 containing die polyadenylation and teraiination signals for early 

20 transcription; (5) widi adenovirus sequences from coordinates 10532- 1 1 156 of the virus- 
associated RNA genes VAX and VAH of pDC201; and (6) pBR322 sequences from 
coordinates 4363-2486 and 1094-375 containing die ampicillin resistance gene and origin 
of replication. 

The resulting human placental cDNA library in sfCAV was used to transform E. 

25 coli strain DH5a, and recombinants were plated to provide iqiproximately 500-600 colonies 
per plate and sufficient plates to provide approximately 30,000 total colonies per screen. 
Colonies were scraped from each plate, pooled, and plasmid DNA prepared from each 
pool. The pooled DNA was dien used to transfect a sub-confluent layer of monkey COS-7 
cells using DEAE-dextran foUowed by chloroquine treatment, as described by Ludnnan et 

30 al., Nucl. Acids Res. 11:1295 (1983) and McCutchan et al., J. Natl. Cancer Inst. 41 :35 1 
(1986). The cells were tiien grown in culture for diree days to permit transient eiqjrcssion 
of the inserted sequences. After tiuee days, ceU culture supematants were discarded and 
die ceU monolayers in each plate assayed for G-CSF binding as follows. Three ml of 
binding medium containing 1.2 x 10-" M I25i.iabeled flag-G-CSF was added to each plate 

35 and die plates incubated at 4*C for 120 minutes. This medium was dien discarded, and 
each plate was washed once witii cold binding medium (containing no labeled G-CSF) and 
twice widi cold PBS. The edges of each plate were dien broken off, leaving a flat disk 
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which was contacted with X-ray film for 72 hours at -70'C using an intensifying screen. 
G-CSF binding activity was visualized on the exposed films as a dark spot against a 
relatively uniform background. 

After approximately 30,000 recombinants from the library had been screened in this 

5 maimer, nine transfectant pools were observed to provide G-CSF binding foci which were 
clearly apparent against the background exposure. 

A frozen stock of bacteria from the positive pool was then used to obtain plates of 
approximately 60 coloiues. Replicas of these plates were made on nitrocellulose filters, 
and the plates were then scraped and plasmid DNA prepared and transfected as described 

10 above to identify a positive plate. Bacteria from individual colonies from the nitrocellulose 
replica of this plate were grown in 0.2 ml cultures, which were used to obtain plasmid 
DNA. The plasmid DNA was then transfected into COS-7 cells as described above. In 
this manner, a single clone, clone D-7, was isolated which was capable of inducing 
expression of G-CSFR in COS cells. A glycerol stock of bacteria transformed with this G- 

15 CSFR cDNA clone in tiie expression vector pCAV/NOT (or pDC302) has been deposited 
with the American Type Culture Collection, 12301 ParMawn Drive, Rockville, MD 20852, 
USA, under accession number 68102. 

An additional cDNA clcme encoding G-CSFR was isolated from the same placental 
library. Recombinants from the placental cDNA library were plated on E, coli strain DH5a 

20 and transformants selected on ampiciUin plates. The transformants were screened by 
plaque hybridization techniques under conditions of high stringency (63*C, 0.2X SSC) 
using a 32p.labeled probe made from the human G-CSFR clone D-7. A hybridizing clone 
(clone 25-1) was isolated which is identical to clone D-7, except tiiat it contains an intron 
insertion after nucleotide 241 1, adding nucleotides 2412-2832 of Figure 6 and resulting in 

25 a change in reading frame and a corresponding change in amino acid sequence. The 3* 
nucleotide sequence and predicted C-terminal amino acid sequence of clone 25-1 are set 
forth in Figure 6. 

Example 3 

30 Construction of cDNAs Encoding Solu ble Human G-CSFR 

Soluble human G-CSFR was cloned into the mammalian expression vector 
pDC302, described above, utilizing the polymerase chain reaction (PCR) technique. The 
following prim^ were used: 

35 

5' End Primer 

y- GGTACCATGG CAAGGCTGGGAAAC 
Asp718 site/Initiation Codon 
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y End Primer 

■S'. TCTAGA ACTCAGCCTCGATGTG 
PglTT/Terminarion Codon 

5 

The PCT product thus contains Asp718 and BglD restriction sites at the 5* and 3' termini, 
respectively. These restriction sites are used to clone into pDC302. The 3' sequence is 
antisense relative to sequence disclosed in Figures 2-5. The template for the PGR reaction 
is clone 25-1, described above, which contains die G-CSFR. The DNA sequences 
10 encoding die G-CSFR are tiien amplified by PGR, substantially as described by Innis et 
al., eds., PCR Protocols: A Guide to Methods and Applications (Academic Press, 1990). 
The resulting amplified clone was tiien isolated and ligated into pDC302 and expressed in 
monkey COS-7 cells as described above. 

25 Example 4 

Prpp^rpri pp of Mono clonal Antibodies to G-CSFR 

Preparations of purified recombinant G-CSFR, for example, human G-CSFR, or 
transfected COS cells expressing high levels of G-CSFR are employed to generate 

20 monoclonal antibodies against G-CSFR using conventional techniques, for example, tiiose 
disclosed in U.S. Patent 4,411.993. Such antibodies are likely to be useful in interfering 
with G-CSF binding to G-CSF receptors, for example, in ameUorating toxic or other 
undesired effects of G-CSF, or as components of diagnostic or research assays for G-CSF 
or soluble G-CSF receptor. 

25 To immunize mice, G-CSFR immunogen is emulsified in complete Freund's 

adjuvant and injected in amounts ranging ftom 10-100 ng subcutaneously into Balb/c mice. 
Ten to twelve days later, the immunized animals are boosted with additional immunogen 
emulsified in incomplete Freund's adjuvant and periodically boosted tiiercaftcr on a weekly 
to biweekly immunization schedule. Serum samples are periodically taken by retro-orbital 

30 bleeding or tail-tip excision for testing by dot-blot assay (antibody sandwich) or EUSA 
(enzyme-linked immunosorbent assay). Other assay procedures are also suitable. 
Following detection of an appropriate antibody titer, positive animals are given an 
intravenous injection of antigen in saline. Three to four days later, the animals are 
sacrificed, splenocytes harvested, and fused to die murine myeloma cell line NSl. 

35 Hybridoma cell lines generated by tiiis procedure are plated in multiple microtiter plates in a 
HAT selective medium (hypoxanthine, aminopterin. and tiiymidine) to inhibit proliferation 
of non-fiised ceUs, myeloma hybrids, and spleen cell hybrids. 
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Hybridoma clones thus generated can be screened by ELIS A for reactivity with G- 
CSFR, for example, by adaptations of the techniques disclosed by Engvall et al., 
Immunochem. 5:871 (1971) and in U.S. Patent 4,703,004. Positive clones are then 
injected into the peritoneal cavities of syngeneic Balb/c mice to produce ascites containing 
5 high concentrations (>1 mgAnl) of anti-G-CSFR monoclonal antibody. The resulting 
monoclonal. antibody can be purified by ammonium sulfate precipitation followed by gel 
exclusion chromatography, and/or afFmity chromatography based on binding of antibody to 
Protein A of Staphylococcus aureus. 
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CLAIMS 

1. An isolated DNA sequence encoding a biologically active mammalian G-CSF 
receptor (G-CSFR) protein, 

5 

2. A DNA sequence according to claim 1 , selected 6xmi the group consisting of: 

(a) cDNA clones having a nucleotide sequence derived from the coding 
region of a native mammalian G-CSFR gene; 

(b) DNA sequences capable of hybridization to the. clones of (a) under 
10 moderately stringent conditions (SOT, 2 x SSC) and which encode biologically active G- 

CSFR molecules; and 

(c) DNA sequences which are degenerate as a result of the genetic code to 
the DNA sequences defined in (a) and (b) and which encode biologically active G-CSFR 
molecules. 

15 

3. An isolated DNA sequence according to claim 1, encoding a soluble biologically 
active mammalian G-CSFR. 

4. A recombinant expression vector comprising a DNA sequence according to 
20 claim 1. 

5. A recombinant expression vector comprising a DNA sequence according to 
claim 2. 

25 6. A recombinant expression vector comprising a DNA sequence according to 

claims. 

7. A process for preparing a mammalian G-CSF receptor or an analog thereof, 
comprising culturing a suitable host cell comprising a vector according to claim 4 under 

30 conditions promoting expression. 

8. A purified biologically active mammalian G-CSF receptor composition. 

9. A purified biologically active mammalian G-CSF receptor composition 
35 according to claim 8, consisting essentially of human G-CSF receptor. 



wo 91/05046 



PCr/US90/05434 



24 

10. A composition for regulating immune or inflammatory responses in a mammal, 
comprising an effective amount of a mammalian G-CSF receptor protein composition 
according to claim 8, and a suitable diluent or carrier. 

5 IL A method for regulating immune responses in a manmial, comprising 

administering an effective amount of a composition according to claim 10. 

12, An assay method for detection of G-CSF or G-CSF receptor molecules or the 
interaction thereof, comprising use of a protein composition according to claim 8. 

10 

13. Antibodies immunorcactive with mammalian G-CSF receptors. 
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FIG. 3 

CAG AAT ATG GGC ATC TGG GTG CAG GCA GAG AAT GCG CTG GGG ACC AGC 800 
Gin Asn Met Gly He Trp Val Gin Ala Glu Asn Ala Leu Gly Thr Ser 
175 180 185 

ATG TCC CCA CAA CTG TGT CTT GAT CCC ATG GAT GTT GTG AAA CTG GAG 848 
Met Ser Pro Gin Leu Cys Leu Asp Pro Met Asp Val Val Lys Leu Glu 
190 195 200 

CCC CCC ATG CTG CGG ACC ATG GAC CCC AGC CCT GAA GCG GCC CCT CCC 896 
Pro Pro Met Leu Arg Thr Met Asp Pro Ser Pro Glu Ala Ala Pro Pro 
205 210 215 220 

CAG GCA GGC TGC CTA CAG CTG TGC TGG GAG CCA TGG CAG CCA GGC CTG 944 
Gin Ala Gly Cys Leu Gin Leu Cys Trp Glu Pro Trp Gin Pro Gly Leu 
225 230 235 

CAC ATA AAT CAG AAG TGT GAG CTG CGC CAC AAG CCG CAG CGT GGA GAA 992 
His He Asn Gin Lys Cys Glu Leu Arg His Lys Pro Gin Arg Gly Glu 
240 245 250 

GCC AGC TGG GCA CTG GTG GGC CCC CTC CCC TTG GAG GCC CTT CAG TAT 1040 
Ala Ser Trp Ala Leu Val Gly Pro Leu Pro Leu Glu Ala Leu Gin Tyr 
255 260 265 

GAG CTC TGC GGG CTC CTC CCA GCC ACG GCC TAC ACC CTG CAG ATA CGC 1088 
Glu Leu Cys Gly Leu Leu Pro Ala Thr Ala Tyr Thr Leu Gin He Arg 
270 275 280 

TGC ATC CGC TGG CCC CTG CCT GGC CAC TGG AGC GAC TGG AGC CCC AGC 1136 
Cvs He Arg Trp Pro Leu Pro Gly His Trp Ser Asp Trp Ser Pro Ser 
285 290 295 300 

CTG GAG CTG AGA ACT ACC GAA CGG GCC CCC ACT GTC AGA CTG GAC ACA 1184 
Leu Glu Leu Arg Thr Thr Glu Arg Ala Pro Thr Val Arg Leu Asp Thr 
305 310 315 

TGG TGG CGG CAG AGG CAG CTG GAC CCC AGG ACA GTG CAG CTG TTC TGG 1232 
Trp Trp Arg Gin Arg Gin Leu Asp Pro Arg Thr Val Gin Leu Phe Trp 
320 325 330 

AAG CCA GTG CCC CTG GAG GAA GAC AGC GGA CGG ATC CAA GGT TAT GTG 1280 
Lys Pro Val Pro Leu Glu Glu Asp Ser Gly Arg He Gin Gly Tyr Val 
335 340 345 

GTT TCT TGG AGA CCC TCA GGC CAG GCT GGG GCC ATC CTG CCC CTC TGC 1328 
Val Ser Trp Arg Pro Ser Gly Gin Ala Gly Ala He Leu Pro Leu Cys 
350 355 360 

AAC ACC ACA GAG CTC AGC TGC ACC TTC CAC CTG CCT TCA GAA GCC CAG 137 6 
Asn Thr Thr Glu Leu Ser Cys Thr Phe His Leu Pro Ser Glu Ala Gin 
365 370 375 380 

GAG GTG GCC CTT GTG GCC TAT AAC TCA GCC GGG ACC TCT CGC CCC ACC 1424 
Glu Val Ala Leu Val Ala Tyr Asn Ser Ala Gly Thr Ser Arg Pro Thr 
385 390 395 
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FIG. 4 

CCG GTG GTC TTC TCA GAA AGC AGA GGC CCA GCT CTG ACC AGA CTC CAT 1472 
Pro Val Val Phe Ser Glu Ser Arg Gly Pro Ala Leu Thr Arg Leu His 
400 405 410 

GCC ATG GCC CGA GAC CCT CAC AGC CTC TGG GTA GGC TGG GAG CCC CCC 1520 
Ala Met Ala Arg Asp Pro His Ser Leu Trp Val Gly Trp Glu Pro Pro 
415 420 425 

AAT CCA TGG CCT CAG GGC TAT GTG ATT GAG TGG GGC CTG GGC CCC CCC 1568 
Asn Pro Trp Pro Gin Gly Tyr Val He Glu Trp Gly Leu Gly Pro Pro 
430 435 440 

AGC GCG AGC AAT AGC AAC AAG ACC TGG AGG ATG GAA CAG AAT GGG AGA 1616 
Ser Ala Ser Asn Ser Asn Lys Thr Trp Arg Met Glu Gin Asn Gly Arg 
445 450 455 460 

GCC ACG GGG TTT CTG CTG AAG GAG AAC ATC AGG CCC TTT CAG CTC TAT 1664 
Ala Thr Gly Phe Leu Leu Lys Glu Asn He Arg Pro Phe Gin Leu Tyr 
465 470 475 

GAG ATC ATC GTG ACT CCC TTG TAC CAG GAC ACC ATG GGA CCC TCC CAG 1712 
Glu He He Val Thr Pro Leu Tyr Gin Asp Thr Met Gly Pro Ser Gin 
480 485 490 

CAT GTC TAT GCC TAC TCT CAA GAA ATG GCT CCC TCC CAT GCC CCA GAG 1760 
His Val Tyr Ala Tyr Ser Gin Glu Met Ala Pro Ser His Ala Pro Glu 
495 500 505 

CTG CAT CTA AAG CAC ATT GGC AAG ACC TGG GCA CAG CTG GAG TGG GTG 1808 
Leu His Leu Lys His He Gly Lys Thr Trp Ala Gin Leu Glu Trp Val 
510 515 520 

CCT GAG CCC CCT GAG CTG GGG AAG AGC CCC CTT ACC CAC TAC ACC ATC 1856 
Pro Glu Pro Pro Glu Leu Gly Lys Ser Pro Leu Thr His Tyr Thr He 
525 530 535 540 

TTC TGG ACC AAC GCT CAG AAC CAG TCC TTC TCC GCC ATC CTG AAT GCC 1904 
Phe Trp Thr Asn Ala Gin Asn Gin Ser Phe Ser Ala He Leu Asn Ala 
545 550 555 

TCC TCC CGT GGC TTT GTC CTC CAT GGC CTG GAG CCC GCC AGT CTG TAT 1952 
Ser Ser Arg Gly Phe Val Leu His Gly Leu Glu Pro Ala Ser Leu Tyr 
560 565 570 

CAC ATC CAC CTC ATG GCT GCC AGC CAG GCT GGG GCC ACC AAC AGT ACA 2000 
His He His Leu Met Ala Ala Ser Gin Ala Gly Ala Thr Asn Ser Thr 
575 580 585 

GTC CTC ACC CTG ATG ACC TTG ACC CCA GAG GGG TCG GAG CTA CAC ATC 2048 
Val Leu Thr Leu Met Thr Leu Thr Pro Glu Gly Ser Glu Leu His lis. 
590 595 600 

ATC CTG GGC CTG TTC GGC CTC CTG CTG TTG CTC ACC TGC CTC TGT GGA 2096 
Tip Lpu Gly T.^» PH** r,^^^ T.p n T.«an T.f>ii T.^n T.Pii Thr Cms T.pii Tyfl Glv 
605 610 615 620 
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FIG. 5 

ACT GCC TGG CTC TGT TGC AGC CCC AAC AGG AAG AAT CCC CTC TGG CCA 2144 

ThT- ala Trn T.on rvs T yc gor Prn Asn Arg Lys Asn Pro Leu Trp Pro 
625 630 635 

AGT GTC CCA GAC CCA GCT CAC AGC AGC CTG GGC TCC TGG GTG CCC ACA 2192 
Ser Val Pro Asp Pro Ala His Ser Ser Leu Gly Ser Trp Val Pro Thr 
640 . 645 650 

ATC ATG GAG GAG GAT GCC TTC CAG CTG CCC GGC CTT GGC ACG CCA CCC 2240 
lie Met Glu Glu Asp Ala Phe Gin Leu Pro Gly Leu Gly Thr Pro Pro 
655 660 665 

ATC ACC AAG CTC ACA GTG CTG GAG GAG GAT GAA AAG AAG CCG GTG CCC 2288 
He Thr Lys Leu Thr Val Leu Glu Glu Asp Glu Lys Lys Pro Val Pro 
670 675 680 

TGG GAG TCC CAT AAC AGC TCA GAG ACC TGT GGC CTC CCC ACT CTG GTC 2336 
Trp Glu Ser His Asn Ser Ser Glu Thr Cys Gly Leu Pro Thr Leu Val 
685 690 695 700 

CAG ACC TAT GTG CTC CAG GGG GAC CCA AGA GCA GTT TCC ACC CAG CCC 2384 

Gin Thr Tyr Val Leu Gin Gly Asp Pro Arg Ala Val Ser Thr Gin Pro 
705 710 715 

« 

CAA TCC CAG TCT GGC ACC AGC GAT CAG GCT GGG CCT CCC AGG CGA TCT 2432 

Gin Ser Gin Ser Gly Thr Ser Asp Gin Ala Gly Pro Pro Arg Arg Ser 
720 725 730 

GCA TAG TTT AAG GAC CAG ATC ATG CTC CAT CCA GCC CCA CCC AAT GGC 2480 
Ala Tyr Phe Lys Asp Gin He Met Leu His Pro Ala Pro Pro Asn Gly 
735 ■ 740 745 

CTT TTG TGC TTG TTT CCT ATA ACT TCA GTA TTG TAA ACTAGTTTTT 2526 
Leu Leu Cys Leu Phe Pro He Thr Ser Val Leu 
750 755 

GGTTTGCAAA AAAAAAAAAA 
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